Using Linguistic Data for English and Spanish Verb-Noun Combination Identification
نویسندگان
چکیده
We present a linguistic analysis of a set of English and Spanish verb+noun combinations (VNCs), and a method to use this information to improve VNC identification. Firstly, a sample of frequent VNCs are analysed in-depth and tagged along lexico-semantic and morphosyntactic dimensions, obtaining satisfactory inter-annotator agreement scores. Then, a VNC identification experiment is undertaken, where the analysed linguistic data is combined with chunking information and syntactic dependencies. A comparison between the results of the experiment and the results obtained by a basic detection method shows that VNC identification can be greatly improved by using linguistic information, as a large number of additional occurrences are detected with high precision.
منابع مشابه
Multilingual Corpus-based Approach to the Resolution of English -ing
Corpus data has proven to be useful for dealing with ambiguities in NLP. A number of studies, for example, have deal with disambiguating English PP attachments, using corpus data (Hindle and Rooth (1993), Brill and Resnik (1994), Steina and Nagao (1997), Ratnaparkhi (1998), and Pantel and Lin (2000), among others). This paper explores a novel approach to resolving ambiguities associated with –i...
متن کاملAn Investigation of the Linguistic, Paralinguistic and Sociocultural Effects of Input on the Perception and Translation of Gerunds by Persian Speakers of English
In this study, it was intended to investigate the Persian native speakers’ perception of gerunds by three different elicitation techniques i.e., written, audio, and pictorial through translation. Eighty intermediate learners of English were asked to select Persian translation of the gerund formsin these elicitation techniques. They were asked to choose one option from a pair of written first la...
متن کاملWhat's in the input? Frequent frames in child-directed speech offer distributional cues to grammatical categories in Spanish and English.
Recent analyses have revealed that child-directed speech contains distributional regularities that could, in principle, support young children's discovery of distinct grammatical categories (noun, verb, adjective). In particular, a distributional unit known as the frequent frame appears to be especially informative (Mintz, 2003). However, analyses have focused almost exclusively on the distribu...
متن کاملObject and Action Naming: A Study on Persian-Speaking Children
Objectives: Nouns and verbs are the central conceptual linguistic units of language acquisition in all human languages. While the noun-bias hypothesis claims that nouns have a privilege in children’s lexical development across languages, studies on Mandarin and Korean and other languages have challenged this view. More recent cross-linguistic naming studies on children in German, Turkish,...
متن کاملCross-Linguistic Differences in a Picture-Description Task Between Korean- and English-Speaking Individuals With Aphasia.
Purpose The purpose of the study was to examine cross-linguistic differences in a picture-description task between Korean- and English-speaking individuals with Broca's and anomic aphasia to determine whether a variation exists in the use of verbs and nouns across the language and aphasia groups. Method Forty-eight individuals (male = 29; female = 19) participated in the study (n = 28 for aph...
متن کامل